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Abstract. Boxicity of a graph G(V, E) is the minimum integer k such 
that G can be represented as the intersection graph of fc-dimensional 
axis parallel rectangles in R fe . Equivalently, it is the minimum num- 
ber of interval graphs on the vertex set V such that the intersection of 
their edge sets is E. It is known that boxicity cannot be approximated 
even for graph classes like bipartite, co-bipartite and split graphs below 
O(n ' 5_6 )-factor, for any e > in polynomial time unless NP = ZPP. 
Till date, there is no well known graph class of unbounded boxicity for 
which even an n e -factor approximation algorithm for computing boxicity 
is known, for any e < 1. In this paper, we study the boxicity problem 
on Circular Arc graphs - intersection graphs of arcs of a circle. We give 
a (2 + -r)-factor polynomial time approximation algorithm for comput- 
ing the boxicity of any circular arc graph along with a corresponding 
box representation, where k > 1 is its boxicity. For Normal Circular 
Arc(NCA) graphs, with an NCA model given, this can be improved to 
an additive 2-factor approximation algorithm. The time complexity of 
the algorithms to approximately compute the boxicity is 0{mn + n 2 ) in 
both these cases and in 0(mn + kn 2 ) = 0(n 3 ) time we also get their 
corresponding box representations, where n is the number of vertices of 
the graph and m is its number of edges. The additive 2-factor algorithm 
directly works for any Proper Circular Arc graph, since computing an 
NCA model for it can be done in polynomial time. 
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1 Introduction 



Boxicity: Boxicity of a graph G(V, E) is defined as the minimum number 
of interval graphs on the vertex set V such that the intersection of their edge 
sets is E. If Ii,I-2 7 - ■ ■ ,1k are interval graphs on the vertex set V such that 
G = I\ PI I2 H • ■ • H ifc, then {Ix, I2, ■ ■ - , ifc} is called a box representation of 
G and k is called the dimension of the representation. Equivalently, boxicity 
is the minimum integer k such that G can be represented as the intersection 
graph of fc-dimensional axis parallel rectangles in R fe . Boxicity was introduced 
by Roberts [15] in 1968. If we have a box representation of dimension k for a 
graph G on n vertices, it can be stored using 0(nk) space, whereas an adjacency 
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list representation will need 0(m) space which is 0(n 2 ) for dense graphs. The 
availability of a box representation in low dimension makes some well known 
NP-hard problems like max-clique, polynomial time solvable|16j. 

Boxicity is combinatorially well studied and many bounds are known in terms 
of parameters like maximum degree, minimum vertex cover size and tree-width 
[4]. Boxicity of any graph is upper bounded by |_5J where n is the number of 
vertices of the graph. It was shown by Scheinerman [T7] in 1984 that the boxicity 
of outer planar graphs is at most two. In 1986, Thomassen [50] proved that the 
boxicity of planar graphs is at most 3. This parameter is also studied in relation 
with other dimensional parameters of graphs like partial order dimension and 
threshold dimension |2l23j . 

However, computing boxicity is a notoriously hard algorithmic problem. In 
1981, Cozzens[5] showed that computing Boxicity is NP-Hard. Later Yannakakis 
|23j proved that determining whether boxicity of a graph is at most three is 
NP-Complctc and Kratochvil|llj strengthened this by showing that determining 
whether boxicity of a graph is at most two itself is NP-Complete. Recently, Adiga 
et.al [2] proved that no polynomial time algorithm for approximating boxicity 
of bipartite graphs with approximation factor less than O(n ' 5_e ) is possible 
unless NP = ZPP. Same non-approximability holds in the case of split graphs 
and co-bipartite graphs too. Even an n e -factor approximation algorithm, with 
e < 1 for boxicity is not known till now, for any well known graph class of 
unbounded boxicity. In this paper, we present a polynomial time (2 + i)-factor 
approximation algorithm for finding the boxicity of circular arc graphs along 
with the corresponding box representation, where k > 1 is the boxicity of the 
graph. There exist circular arc graphs of arbitrarily high boxicity including the 
well known Robert's graph (the complement of a perfect matching on n vertices, 
with n even) which achieves boxicity ^. For normal circular arc graphs, with an 
NCA model given, we give an additive 2-factor polynomial time approximation 
algorithm for the same problem. Note that, proper circular arc graphs form a 
subclass of NCA graphs and computing an NCA model for them can be done in 
polynomial time. We also give efficient ways of implementing all these algorithms. 
Circular Arc Graphs: Circular Arc (CA) graphs arc intersection graphs of 
arcs on a circle. That is, an arc of the circle is associated with each vertex 
and two vertices are adjacent if and only if their corresponding arcs overlap. It is 
sometimes thought of as a generalization of interval graphs which are intersection 
graphs of intervals on the real line. CA graphs became popular in 1970's with a 
series of papers from Tucker, wherein he proved matrix characterizations for CA 
graphs [21] and structure theorems for some of its important subclasses [21]. For 
a detailed description, refer to the survey paper by Lin et.al [12]. Like in the case 
of interval graphs, linear time recognition algorithms exist for circular arc graphs 
too [M]. Some of the well known NP-complete problems like tree- width, path- 
width are known to be polynomial time solvable in the case of CA graphs [T8I19] . 
However, unlike interval graphs, problems like minimum vertex coloring [5] and 
branchwidth [13] remain NP-Complctc for CA graphs. We believe that boxicity 
belong to the second category. 
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A family T of subsets of a set X has the Helly property if for every subfamily 
T' of T . with every two sets in T' pairwise intersecting, we also have C\ A ^ 0. 

Similarly, a family A of arcs satisfy Helly property if every subfamily A' C A of 
pairwise intersecting arcs have a common intersection point. The fundamental 
difficulty while dealing with CA graphs in comparison with interval graphs is 
the absence of Helly property for a family of circular arcs arising out of their 
circular adjacencies. 

A Proper Circular Arc (PCA) graph is a graph which has some CA repre- 
sentation in which no arc is properly contained in another. A Unit Circular Arc 
(UCA) graph is one which has a CA representation in which all arcs are of the 
same length. A Helly Circular Arc (HCA) graph is one which has a representa- 
tion satisfying the Helly property. In a CA representation M, a pair of arcs are 
said to be circle cover arcs if they together cover the circumference of the circle. 
A Normal Circular Arc (NCA) graph is one which has a CA representation in 
which there are no pairs of circle cover arcs. It is known that UCA C PCA C 
NCA and UCA £ HCA £ NCA. 
Our main results in this paper are: 

(a) Boxicity of any circular arc graph can be approximated within a (2 + ^)- 
factor in polynomial time where k > 1 is the boxicity of the graph. 

(b) The boxicity of any normal circular arc graph can be approximated within 
an additive 2-factor in polynomial time, given a normal circular arc model 
of the graph. 

(c) The time complexity of the algorithms to approximately compute the boxic- 
ity is 0(mn + n 2 ) in both the above cases and in 0(mn + kn 2 ) = 0(n 3 ) time 
we also get their corresponding box representations, where n is the number 
of vertices of the graph, m its number of edges and k its boxicity. 

A structural result we obtained in this paper may be of independent interest. 
The following way of constructing an auxiliary graph H* of a given graph H is 
from pp. 

Definition 1. Given a graph H = (V,E), consider the graph H* constructed 
as follows: V(H*) = E(H), and edges wx and yz of H are adjacent in H* if 
and only if {w,x,y, z} induces a 2K2 in H. Notice that H* is the complement 
of [L(H)] 2 , the square of the line graph of H . 

The structural properties of H* and its complement [L(H)] 2 had been exten- 
sively investigated for various graph classes in the context of important problems 
like largest induced matching and minimum chain cover. The initial results were 
obtained by Golumbic et.al [9]. Cameron et.al [3] came up with some further 
results. A consolidation of the related results can be found in [3]. 

The following intermediate structural result in our paper becomes interesting 
in this context: 

(d) In Lemma |4l we observe that if H is a bipartite graph whose complement is 
a CA graph, then H* is a comparability graph. 
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This is a generalization of similar results for convex bipartite graphs and interval 
bigraphs already known in literature |l|24j . This observation helps us in reducing 
the complexity of our polynomial time algorithms. 

2 Preliminaries 

2.1 Notations 

We denote the vertex set of a given graph by V(G) and edge set by E(G), with 
| V{G)\ = n and \E{G)\ = m. We use e to denote min(m, nCi—m). We denote the 
complement of G by G. We call a graph G the union of graphs G±, G2, • • ■ , Gk if 
they are graphs on the same vertex set and E{G) = £ , (Gi)U£ , (G 2 )U- • -UE(Gk)- 
Similarly, a graph G is the intersection of graphs G\, G2, ■• • , Gk if they are 
graphs on the same vertex set and E(G) = E(G±) PI E(G 2 ) H • • • fl E(G k ). We 
use box(G) to denote boxicity of G and x(G) to denote chromatic number of G. 

A circular-arc (CA) model M = (G, A) consists of a circle G, together with a 
family A of arcs of G. It is assumed that G is always traversed in the clockwise 
direction, unless stated otherwise. The arc A v corresponding to a vertex v is 
denoted by [s(v), t(v)], where s(v) and t(v) are the extreme points of A v on G 
with s(v) its start point and t(v) its end point respectively, in the clockwise 
direction. Without loss of generality, we assume that no single arc of A covers 
G and no arc is empty or a single point. 

An interval model / consists of a family of intervals on real line. An interval 
I v corresponding to a vertex v is denoted by a pair \l v (I),r v (I)] , where l v (I) 
and r v (I) are the left and right end points of the interval I v . Without loss of 
generality, we assume that an interval is always non-empty and is not a single 
point. We may use / to represent both an interval graph and its interval model, 
when the meaning is clear from the context. 

Definition 2 (Bi-Consecutive Adjacency Property). Let the vertex set 
V(G) of a graph G be partitioned into two sets A and B with \A\ = n\ and 
\B\ = ra 2 . A numbering scheme where vertices of A are numbered as 1, 2, • ■ • , n% 
and vertices of B are numbered as 1', 2', ■ • • , n' 2 satisfy Bi-Consecutive Adjacency 
Property if the following condition holds: 
For any i £ A and j' € B, if i is adjacent to j' , then either 

(a) j' is adjacent to all k such that 1 < k < i or 

(b) i is adjacent to all k! such that 1 < k' < j' . 

2.2 A Vertex Numbering Scheme for Circular Arc Graphs 

Let G be a CA graph. Assume a CA model M = (G, A) of G is given. Let p be 
any point on the circle G. We define a numbering scheme for the vertices of G 
denoted by NS(M,p) which will be helpful for us in explaining further results. 

Let A be the clique corresponding to the arcs passing through p and let 
B = V \ A. Let \A\ = n\ and \B\ = Number the vertices in A as 1, 2, • ■ • , m 
such that the vertex v with its t(v) farthest (in the clockwise direction) from 
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p gets number 1 and so on. Similarly, number the vertices in B as 1', 2', • • ■ , n' 2 
such that the vertex v' with its t(v') farthest (in the clockwise direction) from 
p gets number V and so on. In both cases, break ties (if any) between vertices 
arbitrarily, while assigning numbers. See Figure [T] for an illustration of the num- 
bering scheme. Now, observe that in G, if a vertex i G A is adjacent to a vertex 




Fig. 1. Example for Numbering of vertices of a CA graph 

f G B, then at least one of the following is true: (a) the point t(i) is contained 
in the arc [s(j'),t(j')] or (b) the point t(j') is contained in the arc [s(i), £(*)]- 
This implies that if i G A is adjacent to j' G B, then either (a) j' is adjacent 
to all k such that I < k < i or (b) i is adjacent to all k' such that 1 < k' < j' . 
Thus we have the following lemma. 

Lemma 1. Given a circular arc graph G and a CA model M(C,A) of G, to- 
gether with a point p on the circle C , let A and B be as described above. 

1. The numbering scheme NS(M,p) ofG defined above satisfy the Bi- Consecutive 
Adjacency Property. 

2. NS(M,p) can be computed in 0(n 2 ) time. 

Using Lcmma[TJ we can prove the following in the case of co-bipartite CA graphs. 

Lemma 2. If G(V, E) is a co-bipartite CA graph, then we can find a partition 
A U B of V where A and B induce cliques, having a numbering scheme of the 
vertices of A and B with A = {I, 2, ■ • ■ , n\} and B — {1', 2', • • • , n' 2 } such that it 
satisfies Bi-Consecutive Adjacency Property. Moreover, the numbering scheme 
can be found in 0(n 2 ) time. 

For a proof of this lemma, refer to Appendix [A"l 

The following lemma is applicable in the case of co-bipartite graphs: 
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Lemma 3. Let G be a co-bipartite graph with a partitioning of vertex set into 
cliques A and B with \A\ = n\ and \B\ = n%- Suppose there exist a numbering 
scheme of vertices of G which satisfies the Bi- Consecutive Adjacency Property. 
Then G is a CA graph. 

The proof is by construction of a CA model M = (C, A) for G. Refer to Appendix 
[3] for the proof. 

3 Computing the Boxicity of Co-bipartite CA Graphs in 
Polynomial Time 

Using some theorems in the literature, in this section we infer that computing 
boxicity of co-bipartite CA graphs can be done in polynomial time. A bipartite 
graph is chordal bipartite if it does not contain any induced cycle of length > 6. 

Theorem 1 (Feder, Hell and Huang 1999 gj]). A graph G is a co-bipartite 
CA graph if and only if its complement is chordal bipartite and contains no 
edge-asteroids. 

A bipartite graph is called a chain graph if it does not contain any induced 2K2 ■ 
The minimum chain cover number of G, denoted by ch(G), is the minimum 
number of chain subgraphs of G such that the union of their edge sets is E(G). 

Recall Definition [1] of H* from Section [TJ 

Theorem 2 (Abueida, Busch and Sritharan 2010 [lj). If H is a bipartite 
graph with no induced cycles on exactly 6 vertices, then 

1. ch{H) = X (H*). 

2. Every maximal independent set of H* corresponds to the edge-set of a chain 
subgraph of H . Moreover, the family of maximal independent sets obtained 
by extending the the color classes of the optimum coloring of H* corresponds 
to a minimum chain cover of H . 

3. In the more restricted case where H is chordal bipartite, H* is a perfect 
graph and therefore, ch(H) and a chain cover of H of minimum cardinality 
can be computed in polynomial time, in view of 1 and 2 above. 

Theorem 3 (Yannakakis 1982 [23j ) . Let G be the complement of a bipartite 
graph H . Then, box(G) = ch{H). Further, if Hi, H2, ■ • ■ , H^ are chain subgraphs 
whose union is H , their respective complements Gi, G%, ■ • ■ , Gk are interval su- 
pergraphs of G whose intersection is G. 

By Thcorcm[TJ if G = H is a co-bipartite CA graph, then H is chordal bipartite. 
Hence by Thcorcm[2l a chain cover of H of minimum cardinality can be computed 
in polynomial time and ch(H) = x{H*). Combining with Theorem [3l we get : 
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Theorem 4. If G is a co-bipartite CA graph, then box(G) = x(H*) and the 
family of maximal independent sets obtained by extending the color classes of an 
optimum coloring of H* corresponds to the complements of interval supergraphs 
in an optimal box representation of G. Moreover, box(G) and an optimal box 
representation of G are computable in polynomial time. 

4 Reducing the Time Complexity of Computing the 
Boxicity of Co-bipartite CA Graphs 

Let t be the number edges of H or cquivalently, the number of vertices in H* . By 
Theorem [21 when H is a chordal bipartite graph, H* is a perfect graph. Using 
the standard perfect graph coloring methods, an 0(t 3 ) algorithm is given in p] 
to compute x(H*). In 0(t 3 ) time, they also compute a chain cover of minimum 
cardinality. However, 0(t 3 ) can be as bad as 0(n 6 ) in the worst case, where n is 
the number of vertices of G. In Q], for the restricted case when H is an interval 
bigraph, they succeeded in reducing the complexity to 0(tn), using the zero par- 
titioning property of the adjacency matrix of interval bigraphs. Unfortunately, 
zero partitioning property being the defining property of interval bigraphs, we 
cannot use the method used in [T] in our case because of the following result 
by Hell and Huang [TO]: A graph H is an interval bigraph if and only if its 
complement is a co-bipartite CA graph admitting a normal CA model. Since 
there arc co-bipartite CA graphs which do not permit a normal CA model, the 
complements of CA co-bipartite graphs form a strict super class of interval bi- 
graphs. Hence to bring down the complexity of the algorithm from 0(t 3 ), we 
have to go for a new method. The key ingredient of our method is the following 
generalization of the results in [1124] . 

Lemma 4. If the complement of graph H is a co-bipartite CA graph, then H* 
is a comparability graph. 

Proof. Let H = G(V,E). Let A U B be a partitioning of the vertex set V as 
described in Lemma [21 where A and B are cliques. Let A = {1, 2, • • • , n{\ and 
B = {1', 2', ■ ■ ■ , n' 2 } be the associated numbering scheme. 

Consider two adjacent vertices of H* corresponding to the edges wx' and yz' 
of H. Since they are adjacent, {w, x', y, z'} induces a 2K 2 in H. Equivalently, 
these vertices induce a 4-cycle in G with edges wy, yx', x'z' and z'w. We claim 
that w < y if and only if x' < z' . To see this, assume that w < y. Since 
yx' e E(G), by the Bi-Consecutive property of the numbering scheme (Lemma 
[U, if z' < x' , yz 1 e E(G) or wx' e E(G), a contradiction. Hence, x' < z'. 

Now, to show that H* is a comparability graph, we define a relation -< as 
ab' -< cd' if and only if a, c £ A, b', d' e B with a < c and b' < d' and {a, b', c, d'} 
induces a 2K2 in H. In view of the claim proved in the paragraph above, if ab' 
and cd' are adjacent vertices of H*, they are comparable with respect to the 
relation -<. 

Let ab' < cd! and cd' -< ef. We have {a, b',c, d'} inducing a 4-cycle in G 
with edges ac, cb', b'd' and d'a. Similarly, {c, d',e, /'} induces a 4-cycle in G 
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with edges ce, ed', d'f and f'c. We also have a < c < e and b' < d! < f , by the 
definition of the relation -<!. By the Bi-Consecutive property of the numbering 
scheme (Lemma H}, cf G E(G) and cd' f E(G) implies that af G E(G). 
Similarly, ed' G E(G) and cd' g E{G) implies that eb' G E(G). Edges ae and 
b'f are parts of cliques A and B. Hence, we have an induced 4-cycle in G with 
edges ae, eb' ', b'f and fa. We can conclude that ab' -< e/'. Thus the relation -< 
is transitive and hence, H* is a comparability graph. □ 

Improved Complexities 

Lemma |4] serves as the key ingredient in improving the time complexities of our 
algorithms. By the definition of H* , a proper coloring of the vertices of H* is 
same as coloring the edges of H such that no two edges get the same color if 
their end points induce a 2K 2 in H or equivalently a 4 cycle in G. Since the 
number of edges in H* may be of 0(t 2 ), where t = \E(H)\, time for computing 
x(H*) might go up to 0(t 2 ) = 0(n 4 ), if we use the standard algorithm for 
the vertex coloring of comparability graphs. Let m AB = n\n<i — t, the number 
of edges between A and B in G. We show that by utilizing the structure of 
G along with the underlying comparability relation on the set of non-edges of 
G defined in the proof of Lemma IH computing the boxicity of G can be done 
in 0(en + ?i 2 ), where e is min(m AB , t). Each color class can be extended to a 
maximal independent set and thus get an optimum box representation of G in 
0(en + fcn 2 ), where k = box(G). The complexities claimed here are obtained 
by a suitable implementation of the greedy algorithm for the vertex coloring of 
comparability graphs, fine tuned for this special case and its careful amortized 
analysis. Due to the structural differences with interval bigraphs as explained 
before, this turned out to be much different from the method used in pQ. For a 
detailed description of the algorithm and its analysis, refer to Appendix |B| 

5 Constant Factor Approximation for the Boxicity of CA 
Graphs 

First we give a lemma which is an adaptation of a similar one given in [2] . 

Lemma 5. Let G(V,E) be a graph with a partition (A, B) of its vertex set V 
with A = {1, 2, • • • , ni} and B = {1', 2', • • • , n' 2 }. Let G\{V, E\) be its supergraph 
such that E 1 = E U {(a', b') : a', b' G B}. Then, box{G 1 ) < 2 • box{G). 

For a proof of this lemma, see Appendix [A} 

Definition 3. Let G(V, E) be an interval graph and L be an interval represen- 
tation of G. Let I = min l u (L) and r = max r u (I). Consider a graph G' (V ,E') 

such that V 2 V and E' = E U {(a, b): a G V \ V and b G V'}. An interval 
representation V of G' obtained by assigning interval [l,r], \/u G V \ V and 
intervals [l u {L),r u (L)\, \/u G V is called an extension of L on V' . 
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Algorithm 1: Find a near optimal box representation of given CA graph 
Input: A circular arc graph G(V, E) 

Output: A box representation of G of dimension at most 2k + 1 where 
k = box(G) 

1 if G is an interval graph then Output an interval representation Iq of G, Exit 

2 Compute a CA model M(C, A) of G 

3 Choose any point p on the circle C 

4 Let A be the clique corresponding to p; B = V \ A 

5 Construct G'(V, E') with E' = E U {(«',?/) :u',v'eB} 

/* G' is a co-bipartite CA graph by Lemma [6] */ 

6 Find an optimum box representation B' = {I{, I2, • • • ,-Tj} of G' 

/* Using the method described in Section [3] */ 

7 Construct an interval representation I for the subgraph induced on B 

/* Induced subgraph on B is clearly an interval graph */ 

8 Construct the extension of / on V 

9 Output B = I2, ■ • ■ , lb, I'} as the box representation of G 



Approximation Algorithm 

A method for computing a box representation of a given CA graph G within a 
(2 + i)-factor where k > 1 is the boxicity of G is given in Algorithm [TJ We use 
the O(en) algorithm for computing boxicity of co-bipartite CA graphs given in 
Section [3] as a subroutine here. Let n = |V(G)| and m = \E(G)\. We can show 
that a near optimal box representation of G can be obtained in 0(mn + kn 2 ). For 
more details, refer to Appendix [A] If we just want to compute the approximate 
boxicity of G, it is enough to output box(G') + 1, as proved below. This can be 
done in 0(mn + n 2 ). 

Proof of correctness: Let us analyze the non-trivial case when G is not an 
interval graph. Otherwise, the correctness is obvious. 

Lemma 6. G' constructed in Line\^ of Algorithm^ is a co-bipartite CA graph. 

Proof. It can be easily seen that G' is a co-bipartite graph on the same vertex 
set as that of G with cliques A and B and V = A U B. Consider a numbering 
scheme NS(M,p) of G as described in Section |2~21 such that A = {1, 2, • • • , ni} 
and B = {1', 2', • • • , n' 2 }, based on the CA model M(C, A) and the point p as 
chosen in Algorithm [T] Notice that by construction of G', for any pair of vertices 
i e A and j' e B, (u,v') G E if and only if (u,v') 6 E' . Recall that the 
numbering scheme NS(M,p) satisfies Bi-Consecutive Adjacency Property for G 
by Lemma [1] Clearly, the same will apply to G' also. Hence by Lemma El we 
can infer that G' is a co-bipartite CA graph. □ 

Lemma 7. The box representation B = {/{, 1%, ■ ■ ■ ,1^,1'}, obtained in Line\lJ\ 
of Algorithm\]]is a valid box representation of G with \B\ < 2 • box(G) + 1. 

Proof. It is easy to see that /' constructed in Line |8] of Algorithm [T] is a super- 
graph of G, since I is an interval representation of the induced subgraph of G 
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on B and I' is an extension of I on V. Since B' is a box representation of G', 
each l[ £ £>', for 1 < i < b is a supcrgraph of G' and in turn of G too. A is 
a clique in G by definition. Consider any (u,v') ^ £7 with u £ A and u' G £>. 
Clearly, (it, i/) ^ 7?' as well and since 23' is a box representation of G', 3i such 
that (u, v) £ E(I-) for some 1 < i < b. For any (u',v') ^ E 1 with u',v' £ B, we 
have « u') £ 75(7'). Thus, G = I' D f] l[. 

l<i<b 

Thus, B = {7{, 7^, • • ■ , 7£,,7'} is a valid box representation for G of size 
box(G') + 1. By Lemma O box(G') < 2 • box(G), implying that 23 is of size at 
most 2 • boz(G) + 1. □ 

Lemma [7] implies that B is a (2+i)-factor approximate box representation where 
k > 1 is the boxicity of G. 



6 Additive 2-Factor Approximation for the Boxicity of 
Normal CA Graphs 

We assume that a normal CA model M(C, A) of G is given. An additive two 
factor approximation algorithm for computing a box representation of normal 
CA graphs is given in Algorithm |21 We can show that in 0(mn + kn 2 ) time, the 
algorithm outputs a near optimal box representation of G where n = \V{G)\ 1 
m = \E(G)\ and k = box(G). Refer to Appendix [A] for more details. If we 
just want to compute the approximate boxicity of G, it is enough to output 
box(H) + 2, as proved below. This can be done in 0(mn + n 2 ). 
Proof of correctness: Since G is a normal CA graph, the set of arcs passing 
through p does not contain any circle cover pair of arcs. Therefore, [p,pi] U 
[P2tP\ does not cover the entire circle G. So, any point in the arc ipi,Pi), in 
particular the point q defined in Line [5] of Algorithm [21 is not contained in any 
arc passing through p. It follows that A R B = 0. Since A and B are cliques, 
H, the induced subgraph on A U B is a co-bipartite CA subgraph of G. We can 
compute an optimum box representation £>' of H in polynomial time using the 
method described in Section [3] 

I A and Ib are interval graphs because they are obtained by removing vertices 
corresponding to arcs in A passing through points p and q respectively. Since 
I A is a supergraph of G on V \ A and I' A is the extension of I a on V, we can 
conclude that I' A is a super graph of G. Similarly, I' B is also a super graph of G. 
Since £>' is a box representation of H, each 7, £ B' is a supergraph of induced 
subgraph H . Since I[ is the extension of U on V, I[ is a super graph of G. 

Consider (u,v) ^ E. Case (i) If u, v £ V \ A, by construction of I' A , 
(u, v) g E(I' A ). Case (ii) If u, v £ V \ B, by construction of I' B , (u, v) E(I' B ). 
Remember that A and B are cliques. If both (i) and (ii) are false, then one of 
{u, v} is in A and the other is in B. Since B' is a box representation of H, 
(u,v) £ E(Ii) for some 1 < i < h = \B'\. By construction of 7-, (u, v) £ E(I-) 
too. Hence, G = I' A n I' B (1 f] I[. Thus we get B = {I' A , I' B , I[ ,7 2 , • • • , I' h } is 

Ki<h 
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Algorithm 2: Find a additive 2-optimal box representation of given nor- 
mal CA graph 

Input: A normal CA graph G(V, E), with an NCA model M (C, A) of G 
Output: A box representation of G of dimension at most k + 2 where 
k = box(G) 

1 if G is an interval graph then Output an interval representation Iq of G, Exit 

2 Choose any point p on the circle C; Let A be the clique corresponding to p 

3 Let pi be the farthest clockwise end point of any arc passing through p 

4 Let p2 be the farthest anticlockwise end point of any arc passing through p 

5 Let q be a point on the arc [pi,p2] with q ^ pi,P2 

6 Let B be the clique corresponding to q 

7 Let H be the induced subgraph on A U B 

/* Clearly, H is a co-bipartite CA graph */ 

8 Find an optimum box representation B' — {Ii, I2, ■ ■ ■ , Ih} of H 

/* Using the method described in Section [3] */ 

9 for i = 1 to h do Construct I[, the extension of h on V 

10 Construct an interval representation I a for the induced subgraph on V \ A 

/* Induced subgraph on V \ A is an interval graph */ 

11 Construct I' A , the extension of I a on V 

12 Construct an interval representation Ib for the induced subgraph on V \ B 

I* Induced subgraph on V \ B is an interval graph */ 

13 Construct I' B , the extension of Ib on V 

14 Output B = {/(, I2, • • ■ , I'h, I At I'b} as the box representation of G 



a valid box representation of G of size box(H) + 2 which is at most box{G) + 2, 
since H is an induced subgraph of G. 

In Algorithm [2l we assumed that an NCA model of the graph is given. This 
was required because recognizing NCA graphs in polynomial time is still an open 
problem. We can observe that though the algorithm of this section is given for 
normal CA graphs, it can be used for a wider class as stated below. 

Theorem 5. If we are given a circular arc model M(C, A) of G with a point p' 
on the circle C such that the set of arcs passing through p' does not contain a 
circle cover pair, then we can approximate the boxicity of G within an additive 
2-j "actor in polynomial time using Algorithm^ 

Proof. In Line [2] of Algorithm [2l select p' (guaranteed by the assumption of the 
theorem) as the point p. Such a point can be found in 0(n 2 ) time, if it exists. 
The rest of the algorithm is similar. □ 

Though such a representation need not exist in general, it does exist for many 
important subclasses of of CA graphs and can be constructed in polynomial 
time; for example, for proper CA graphs or normal helly CA graphs. In fact, 
for these classes, construction of a normal CA (NCA) model itself from their 
adjacency matrices can be done in polynomial time. 

Corollary 1. Boxicity of any proper circular arc graph can be approximated 
within an additive 2- factor in polynomial time. 
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A Appendix 1 
Proof of Lemma [2j 

Let G be a co-bipartite CA graph. Recall that a circular arc model of G is 
constructable in linear time. In any circular arc model M(C, A) of a co-bipartite 
CA graph G, there are two points p\ and P2 on the circle C such that every arc 
passes through at least one of them [22112] . It is easy to see that these points 
can be identified in 0(n 2 ) time. Let the clique corresponding to pi be denoted 
as A. Let B = V\A, which is clearly a clique, since the arcs corresponding to all 
vertices in B pass through p2- Let \A\ = ri\ and \B\ = ri2- Let vertices in A be 
numbered 1, 2, ■ • • , n\ and vertices of B be numbered 1', 2', • ■ • , n' 2 according to 
the numbering scheme NS(M,pi) as described in the beginning of Section [2~2l 
Clearly this numbering scheme satisfies Bi-Consecutive Adjacency Property by 
Lemma [T] □ 

Proof of Lemma [3j 

The proof is by construction of a CA model M(C, A) for G. 
Step 1: Choose four distinct points a, b 1 c, d in the clockwise order on C. Initially 
fix s(i) = a for all i £ A and s(j') ~ c for all f £ B. Choose n\ distinct points 
p ni , p ni _x, • ■ •, pi in the clockwise order on the arc (a, b) and set t(i) = Pi for 
all i £ A. Choose ri2 distinct points p n i , p„ 2 _i/, ■ ■ •, py in the clockwise order 
on the arc (c, d) and set t(j') = py for all j' £ B. As of now, the family of arcs 
that we have constructed represents two disjoint cliques corresponding to A and 
B. 

Step 2: Now we will modify the start points of each arc as follows: Consider 
vertex i £ A. If j' £ B is the highest numbered vertex in B such that i is adjacent 
to all k 1 with 1' < k' < j' , then set s(i) = t(j') = py . Similarly, Consider vertex 
j' £ B. If i £ A is the highest numbered vertex in A such that j' is adjacent to 
all k with 1 < k < i, then set s(j') = t(i) = pi. Notice that we are not making 
any adjacencies not present in G between vertices of A and B in this step. 

Since A and B are cliques, what remains to prove is that if a vertex i £ A is 
adjacent to a vertex j' £ B, their corresponding arcs overlap. Consider such an 
edge If j 1 is adjacent to all k such that 1 < k < i, we would have extended 

s(j') to meet t(i) in Step 2 above. If this does not occur, then by assumed Bi- 
Consecutive Adjacency Property, i is adjacent to all k 1 such that 1 < k' < j' . In 
this case, we would have extended s(i) to meet t(j') in Step 2. In both cases, the 
arcs corresponding to vertices i and j' overlap. We got a CA model of G proving 
that G is a CA graph. □ 

Proof of Lemma [5j 

Let k be the boxicity of G and {7i, I2, ■ ■ ■ , Ik} be an optimal box representation 
of G. For each 1 < i < k, let U = min l u {Ii) and = max r u (Ii). Let 7^ be the 

interval graph obtained from li by assigning the interval [l u (Ii), r u {Ii)\ ,VueA 
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and the interval [Zf, r v i (-Zj)l , W S £?. Let ii 2 be the interval graph obtained from 
Ii by assigning the interval [l u (Ii), r u (Ii)\ ,\/u £ A and the interval [Z„'(/j), r^] , 
Vu' e B. 

Note that, in constructing Ii 1 and ii 2 we have only extended some of the 
intervals of Ii and therefore, 1^ and Ii 2 are super graphs of / and in turn of 
G. By construction, B induces cliques in both 1^ and Ii 2 , and thus they are 
supergraphs of G\ too. 

Now, consider (u,v') ^ E with u G A, v' G _B. Then either rv(ii) < l u {h) 
or r u (ij) < l V '(Ii). If r V '(Ii) < l u {Ii), then clearly the intervals [Zj, rv (7j)] and 
[Z„(Jj), r„(/i)] do not intersect and thus (it, i>') ^ E(Ii x ). Similarly, if r u {Ii) < 
l v '(Ii), then (u,v') £ E(Ii 2 ). If both u, u G A and (it,v) ^ E, then 3i such that 
(u,v) E(Ji) for some 1 < i < k and clearly by construction, (u,v) £ E{Ii x ) 
and (u, v) <£ E(I i2 ). 

It follows that Gi = P) n Ij 2 and therefore, &ox(Gi) < 2 ■ box(G). □ 

i<i<fe 

Time Complexity of the Algorithm of Section [5j 

Let n = | V(G) | and m = \E(G)\. Whether the given graph is an interval graph 
can be determined in linear time. Given any CA graph G(V, E), we can compute 
a CA model M(C, A) for G in linear time [14]. A partition (A, B) of the vertex set 
of G as mentioned in the algorithm can be constructed in 0(n) time from the CA 
model M of G. Construction of G'(V, E') from G(V, E) can also be done in 0(n 2 ). 
Let m AB = \{ab' £ E(G)\a e A and b' G B}\ and m' AB = \{ab' G E(G')\a G A 
and 6' G In Section HI we discussed how to compute the boxicity of G' in 
0(en+n 2 ) time and an optimal box representation of co-bipartite CA graph G' in 
0(en + kn 2 ) where k is the boxicity of G' and e = min(m' 4B , ri\n<i — m' AB ). Since 
by construction of G' , m' AB = m AB < m, the time complexity is 0(mn + kn 2 ). 
The additional work for computing /' in the construction of B can be done in 
0(n). Thus, a near optimal box representation of G is obtained in 0(mn + kn 2 ). 

Time Complexity of the Algorithm of Section [6} 

Whether the given graph is an interval graph can be determined in linear time. 
Choosing point q can be done in 0(n) time. Construction of H from G can also 
be done in 0(m + n). Let k = box(G) and h = box(H). Since H is an induced 
subgraph of G, h < k. In Section HI we discussed how to compute the boxicity of 
H in 0(en' + n' 2 ) and an optimal box representation of co-bipartite CA graph 
H m_0{hen' + hn' 2 ), where ri = \V(H)\ < n and e = min(|B(i7)|, \A\.\B\ - 
\B(H) | ) < m. The additional work for computing the interval supergraphs I' A , I' B 
is 0(n). Construction of /(, ij? • • • ■> I'k horn I\, I2, ■ ■ ■ ,Ih requires only 0(h.n) = 
0(n 2 ) time in total. Thus, in 0(mn + kn 2 ) time, the algorithm outputs a near 
optimal box representation of G. 
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B Appendix 2 - Complexity of Computing the Boxicity 
and Optimal Box Representation of Co-bipartite CA 
graphs 

Let G(V, E) be a co-bipartite CA graph with \E\ = m and |V| = n. Let AU B 
be a partitioning of the vertex set V as described in Lemma [21 where A and 
B are cliques. Let A = {1, 2, ■ • • , rii} and B = {1', 2', • • ■ , n 2 } be the associated 
numbering scheme. Let m AB = \{ab' 6 £7(G)|a G A and 6' G £>} and i = 
ni7T-2 — tj^b = | . Let e = mm(m AB ,t) and fc = box(G). In this section, 

we will show an 0(en + n 2 ) algorithm to compute the boxicity and an 0(en + 
kn 2 ) algorithm to get an optimal box representation of G. Let H = G. Recall 
that by Theorem|H box(G) = \(H*). Let Gi, G2, • • • , Ck be the color classes in 
an optimal coloring of H*. For 1 < i < k, let C[ be a maximal independent 
set containing Ci and £!j = {e G E(H): e corresponds to a vertex in G|}. 
By Theorem |4l {Gi = .Hj : flj = (Vj, -Ei), 1 < i < k} gives an optimal box 
representation of G. 

B.l Computing the Boxicity of G in 0(en + n 2 ) Time 

We call ab' a non-edge of G, if it is an edge of H. Recall that by Lemma IH H* 
is a comparability graph. We had defined a transitive relation -< on V(H*) i.e 
on the non-edges of G, in the proof of Lemma 2] as follows : ab' -< cd' if and 
only if a < c and b 1 < d! and {a, b', c, d'} induces a 4-cycle in G. Since H* is a 
comparability graph, any coloring satisfying the property that the color assigned 
to (the vertex corresponding to) a non-edge cd' equals max Colorle) + 1 

is an optimum coloring [7] of H*. We refer to this as greedy strategy in our 
further discussion. For convenience, hereafter we refer to the coloring of a vertex 
of H* as coloring of the corresponding non-edge of G. 

Assume that the colors available are 1,2, The following definitions are 
with respect to G. For X C V, let N K (v) represent the set of neighbors of v 
in X and N x (v) =X\ N x (v). Similarly, for S CV, N X (S) = (J N x (v) and 

v£S 

N X (S) = [J N x (v). Let deg x (v) denote \N x (v)\. The linked lists corresponding 

to N B (v) and N B (v) for each v G A and N A (v') and N A (v') for each u' G i?, 
with their entries sorted with respect to the numbering scheme described in the 
above paragraph, can be constructed from the adjacency list of G. This can be 
done in overall 0(n 2 ) time. We will assume that lists N A , N A , N B , N B are global 
data structures. 

For igi, we color the non-edges incident on x by invoking Algorithm [3j for 
x = 1, 2, • • • , n\ in that order. For the convenience of our analysis, we refer to an 
invocation of Algorithm [3] for vertex x as the processing of x. Note that, by the 
time a non-edge xy' of G is considered for coloring, i.e, during the processing 
of x, all non-edges tvl such that tu' -< xy' are already colored, since, by the 
definition of -<, t < x and t is processed before x. Consider a non-edge xy' 
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of G. Let F xy > = F y i = {ab' G E(H) : ab' -< xy'}. According to the greedy 
algorithm, the non-edge xy' of G has to get the color maxcolor{F y i) + 1, where 
maxcolor(F„,i) — max Color(ab'). 

y ab'eF y , 

The next question is how to find maxcolor(F y i) efficiently. For that we need 
to understand the set F y > more closely. Let P = {a G N A (N B (x))\a < x} and 
Q = {V G N B (x)\b' < mmN B (x)}. 

Claim. F y , = |+J {ab 1 G E{H) : b' G Q} = {ab' G E{H) : a G N A {y') n P 

a£N A (y>)nP 

and b' G Q}. 

Proof. Since ify = {ab' G E(H) : ab' -< xy'}, we need to show that for any 
a&' G a&' -< xy' if and only if a G iV A (y') n P and b' G Q. Recall that 

ab' -< xy' if and only if a < c, 6' < tf and {a, 6', c, d'} induces a 4-cycle in G. 
Observe that N A (y') Pi P = {a <E N A (y') : a < x}. It is easy to see that if 
a G N A (y') n P and b' G Q, then aft' -< xy'. 

To prove the other direction, assume that ab' -< xy'. Then we have a G 
N A (y'), a < x and therefore, a G N A (y') n P. Similarly 6' G N B (x), b' < y'. 
Suppose minA^ B (a;) < b' . Since the numbering scheme satisfies Bi-Consecutive 
Adjacency Property, xb' G E(G) implies that either (x, min N B (x)) G E{G) or 
ab' G E(G), which is a contradiction. Therefore b' < mmN B (x) and therefore, 

VeQ. □ 

By the above claim, maxcolor(F v i) = max { max {Color(ab')}}. 

a£N A (y>)nP b'£Q,ab'eE(H) 

But, if we have to do this computation separately for each y' G N B (x), then 
for any a G P which is in N A (y') of more than one y' , the computation of 
maxcolorfa) = max {Colorfab')} has to be repeated. To avoid this rep- 

b'EQ,ab'£E{H) 

ctition, in Algorithm[3]we process all non-edges incident at a vertex x G A in par- 
allel as follows. In Lines [6] to [13] of Algorithm[3j for each a G P, maxcolor(a) + 1 
is computed and stored. This is referred to as Type 1 work in the algorithm. 
For each y' G N B (x), Lines H4l to [2X1 referred to as Type 2 work in Algorithm [3] 
computes maxcolor(F y i) + 1 using the values of maxcolor(a) + 1 already com- 
puted and stored as part of Type 1 work. In the process, for each y' G N g (x), 
the algorithm assigns the color maxcolor(F y i) + 1 to xy' , which is the optimum 
color suggested by the greedy strategy. 

Lemma 8. Time spent over all invocations of Algorithm^ is 0(en + n 2 ). 

Proof. Let m AB = \{ab' G E(G)\a G A and b' G B}\ and t = n\ni — m AB = 
\E(G)\. Let e = min(m AB ,£) and k = box{G). 

Type work (Lines [T]to[3|) computes lists Q and R as defined in the algorithm 
and also an indicator array Ap of P. Q and R can be represented as doubly linked 
lists. Initializations in Line [T] can be achieved in 0(en + n 2 ) time as follows: Ap 
can be initialized to in 0(n) once for each x G A. The total time for this work 
is 0(n 2 ). Each y' G B spends at most \N A (y')\ time for the processing of each 
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Algorithm 3: Computing colors of non-edges incident on vertex x G A 
Input: x £ A 

Output: Color(xy') for each y' G N B (x) 

/* Type work : Lines [l] to [3] - Initializations */ 

/* Let P = {a G N A {N B {x))\a < x} */ 

1 For 1 < a < m, let yip [a] = initially. For each a G P, set Ap[a] = 1 and 
color [a] = 

2 Compute Q = {b' G N B (x)\b' < p'}, where p = min (JV B (as)) = N B (x)[l] and 
initialize ptrl[b'} = start of iV 4 (&') for b' G Q 

3 Compute i? = N B (x) and initialize ptr2[r'] — start of N A (r') for r' G -R 

4 for cur = 1 to 7ii do 

5 if .Ap[citr] = 1 then 

/* Type 1 work : Lines \§\ to 1131 - Computing color[cur] — 1+ the 
maximum color given to a non-edge between cur and Q */ 

6 for each q' in Q do 

7 while N A (q')[ptrl[q'}] < cur and not list-end of N A (q') do 

8 Increment the pointer ptr l[q'] 

9 end 

10 if N A (q') \ptr 1 [?']] = cur then 

11 coior[citr] = Meta^coZorfeur], Coloricur q') + 1) /* non-edge 
(cur q') is already colored */ 

12 else if list-end of N A (q') then delete q' from Q 

13 end 

/* Type 2 work : Lines 1141 to 1211 - Identify non-edges at x 
affected by non-edges between cur and Q and update their 
colors if necessary */ 

14 for each r' in R do 

15 while N A (r')[ptr2[r']] < cur and not list-end of N A (r') do 

16 Increment the pointer ptr2[r'] 

17 end 

18 if N A (r')[ptr2[r']} = cur then 

19 if Color(xr') < color[cur] then C'olor(xr') — color[cur] 

20 else if list-end of N A (r') then delete r' from R 

21 end 

22 end 

23 end 
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x £ N A (y 1 ). Thus the total time spent by all y' £ B together for this initialization 
is deg A (y')(ni — deg A (y')) = 0(en). Initialization of Q in Line [2] can be done 

y'EB 

in 0(deg B (x)). Summing over all x £ A, this amounts to 0{m AB ) = 0{m) work. 
Similarly, for initializing i? in Line [3J we need 0(ni — deg B {x)). Summed over 
all x £ A, this amounts to 0(t) work. Adding all the above, total cost of Type 
work (over all invocations of Algorithm [3j is 0(en + n 2 + m + t) = 0(en + n 2 ), 
since m + t — 0(n 2 ). 

Let us calculate the total cost spent in Type 1 work. Note that each ele- 
ment q' £ Q remembers the pointer position ptr\[q'\. This means that ptrl[q'] 
continues from where it stopped in the current iteration, while doing the Type 
1 work of the next element of P. Therefore, pointer ptrl[q'\ moves at most 
n\ — deg A {q') times for each q' £ Q. When ptrl[q'] reaches the end of list 
N A (q'), q' is deleted from the linked list Q. This makes sure that Line [7] is 
repeated just 0(ni — deg A (q')) times for each q' £ Q. Each q' executes Line 
ITT1 whenever cur £ N A (q'). This also happens ri\ — deg A (q') times during the 
processing of each x where x £ N A (q'). Hence the total cost for Type 1 work 
(over all invocations of Algorithm [3]) is ^ (m — deg A (b')).deg A (b') = 0(en). 

b'EB 

Now consider Type 2 work. Note that each element r' € R remembers 
the pointer position ptr2[r']. This means that ptr2[r'] continues from where 
it stopped in the current iteration while doing the Type 2 work of the next el- 
ement of P. Therefore, pointer ptr2[r'] moves at most deg A (r') times for each 
r' 6 R = N B (x). When ptr2\r'] reaches end of list N A (r 1 ), r' is dropped from the 
linked list R. This makes sure that Line Qj)] is repeated only 0(deg A (r')) times 
for each r' £ R, while processing an x such that x £ N A {r'). Also, Line IT51 is 
executed only when cur £ N A {r'). This happens deg A (r') times during the pro- 
cessing of each x, where x £ N A (r r ). Summing up, the total cost for Type 2 work 
(over all invocations of Algorithm [3]) is deg A (b').(ni — deg A {b')) = 0(en). 

b'EB 

Thus the total cost spent over all invocations of Algorithm [3] is 0(en + n 2 ) 
as claimed. □ 



B.2 Expanding Color Classes of H* to Maximal Independent Sets 
in 0(en + kn 2 ) 

In this section, we describe an algorithm which computes an optimal box rep- 
resentation B = {Gi, G2, • • • , Gfc} of G as explained in the beginning of Section 
[El where k is the maximum number of colors used by the algorithm of Section 
IB. II Since edges of H correspond to vertices of H* by defintion, we do not dif- 
ferentiate between a non-edge ab' £ E(H) and its corresponding vertex in H* in 
the following discussions in this section. Recall from the beginning of Section [5] 
that we can compute B by computing C' iy for 1 < i < k, where C[ is a maximal 
independent set containing C, - the i th color class in the optimal coloring of H* 
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obtained by the algorithm of Section fB.il The following lemma suggests one way 
to compute these maximal independent sets. 

Lemma 9. Let Ci be the i th color class in the optimal coloring of H* obtained 
by the algorithm of Section \B.1\ Let Si = (J C, and let MaxSi be the set of 

\<j<i 

maximal elements of (Si, -<), i.e., MaxSi = {ab' £ Si : $cd' £ Si with ab' -< cd'}. 
Then MaxSi is a maximal independent set in H* containing Ci. 

Proof. MaxSi, being the set of maximal elements of (Si,~<), forms an inde- 
pendent set in H* . Recall that, as per our algorithm, for any ab' £ E(H), 
Color(ab') = max Color(e) + 1. Consider ab' £ Ci. If Bed' : ab' -< cd' , 

e£E(H):e-<ab' 

then Color(cd') > Color(ab') = i and therefore, cd' £ Si. Hence, by the defini- 
tion of MaxSi, ab' £ MaxSi. Thus, d C MaxSi. 

Consider any ab' £ MaxSi. Either ab' £ (Si \ MaxSi) or ab' £ Si. In the 
former case, Bed' £ MaxSi with ab' ~< cd' . In the latter case, when ab' Si, 
Color(ab') > i and it is easy to see from our coloring strategy that Bed' £ C, C 
MaxSi with cd' -< ab'. Therefore, in both cases, if ab' is added to MaxSi, it will 
no longer be an independent set. Thus, MaxSi is a maximal independent set 
containing Ci. □ 

The next question is to efficiently compute MaxSi, for 1 < i < k. For this 
purpose, we introduce the following definition. 

Definition 4. For each ab' £ E(H), let 

{min {Color(e)}, if Ele £ E(H) such that ab' < e 

eeE(H),ab'~<e 
k + 1, otherwise 

Lemma 10. For 1 < i < k, MaxSi = {ab' £ Si : Next(ab') > i} 

Proof. If ab' £ MaxSi, then Bed' £ Si with ab' -< cd' . It will follow that 
Next(ab') < Color(cd') < i. Conversely, if Next(ab') < i, then Bed' £ Si with 
ab' -< cd' and hence a&' ^ MaxSi. d 

Let Next(ab') for all a&' £ E(H) be initialized to fc + 1. This can be done in 
0(\E(H)\) = 0(n' 2 ). Consider the following strategy. Take a non-edge e £ E(H) 
and update Next(ab') of all a&' -< e with min(A r ext(a&'), Color(e)). When we 
have repeated this for all e £ E(H), it is easy to see that the values of Next(ab') 
for every ab' £ E(H) will satisfy Definition |4j 

Here, we show that we can do this in 0(en + n 2 ). In order to achieve this, 
we process the non-edges incident at a vertex x £ A together, in an invocation 
of Algorithm |4] - hereafter called the processing of x. During the processing of 
x, each non-edge xy' incident at x updates Next(ab') of all ab' -< xy' with 
mm(Next(ab') , Color (xy')) . We will process x = 1, 2, • • • , n\ in that order. The 
data structures used are similar to those used for Algorithm fj| 

Consider an x £ A. As in Section TB.ll let P = {a £ N A (N B (x))\a < x], 
Q = {b' £ N B (x)\b' < minA> B (a;)} and F y > = {ab' £ E(H) : ab' -< xy'}. Let 
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T x = (J {ab' G £(if) : o6' -< xy'} = [j F y >. Observe that, for any 

y'eN B (x) V'eN B (x) 

ab' £ E{H) \ T x , the value of Next(ab') does not depend on the colors assigned 
to non-edges incident at x. By the claim proved in Section IB. 11 F y > = {ab' £ 
E{H) : a £ N A (y') n P and b' £ Q}. Hence, T x = {ab' £ : a £ P and 

6' £ Q}. Hence, during the processing of x £ A, we just need to update the Next 
values of non-edges between P and Q only. 

Consider any ab' £ T x . The set of non-edges incident at x whose colors can 
affect the value of Next(ab') belong to the set {xy' : y' £ N B (x) and ab' -< xy'} 
= {xy' : y' £ N B (x) (~l N B (a)}. Notice that for any fixed a £ P, this set is 
independent of which b' £ Q is being considered. Let us denote this set by U a . For 
any non-edge ab' £ E(H) with a £ P and b' £ Q, Next(ab') < min {Col or (e)}. 

Hence, we can make the following inference, which is critical for the efficiency of 
Algorithm [4j 

Fact. For a fixed vertex a £ P, for any non-edge ab' £ E{H) between a and 
Q, we just need to update Next(ab') with mhi(Next(ab'), MinColor[a\), where 
MinColor[a] = min {Col or (e)}, irrespective of which b' £ Q is involved. 

In Algorithm 0] Type 1 work computes MinColor[a] for every a £ P. Type 
2 work updates Next(ab'), for each a&' £ T x with min(A^ea;t(a6'), MinCoZor[a]). 
By the time we have processed all x £ A, all non-edges e £ E(H) get processed 
and hence Next(ab') for each a6' £ E(H) is correctly computed. 

Lemma 11. Time spent over all invocations of Algorithm^is 0(en + n 2 ). 

Proof. Type work done by Algorithm @] (Lines [T] to [3]) is similar to the Type 
work of Algorithm [3] and hence the total cost of Type work over all invocations 
of Algorithm 0] is 0(en + n 2 ). 

Let us calculate the total cost spent in Type 1 work. Note that each element 
r £ R remembers the pointer position ptrl[r']. This means that ptrl[r'] contin- 
ues from where it stopped in the current iteration while doing the Type 1 work of 
the next element of P. Therefore, pointer ptrl[r'] moves at most deg A (r') times 
for each r' £ R = N B (x). When ptr\[r'] reaches end of list N A (r'), r' is dropped 
from the linked list R. This makes sure that Line[7]is repeated only 0(deg A (r')) 
times for each r' £ R, while processing an x such that x £ N A (r'). Also, Line [Til 
is executed only when cur £ N A (r'). This happens deg A (r 1 ) times during the pro- 
cessing of each x, where x £ N \ (r'). Summing up, the total cost for Type 1 work 
(over all invocations of Algorithm |4]) is deg A {b').{n\ — deg A {b')) = 0(en). 

b'eB 

Now consider Type 2 work. Note that each element q' £ Q remembers 
the pointer position ptr2[q']. This means that ptr2[q'] continues from where 
it stopped in the current iteration, while doing the Type 2 work of the next 
element of P. Therefore, pointer ptr2[q'] moves at most n\ — deg A (q') times for 
each q' £ Q. When ptr2[q'\ reaches the end of list N A (q'), q' is deleted from the 
linked list Q. This makes sure that Line [15] is repeated just 0{n\ — deg A (q')) 
times for each q' £ Q. Each q' executes Line IT9l whenever cur £ N A (q'). This also 
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Algorithm 4: Each non-edge xy' incident at vertex x G A updates 
Next(ab') of all non edges ab' -< xy' 
Input: x £ A 

Output: The updated Next(ab') for each non edges ab' -< xy' where y' G N B (x) 
/* Type work : Lines [TJ to \S\ - Initializations */ 
/* Let P = {a e N A (N B (x))\a< x} */ 

1 For 1 < a < m, let j4p[a] = initially. For each a G P, set Ap[o] — 1 and 
MinColor[a] = k + 1 

2 Compute Q = {b' G jV B (x)|6' < p'}, where p' = min (JV B (as)) = iV B (ar)[l] and 
initialize pir2[6'] = start of iV A (6') for b' G Q 

3 Compute i? = N B (x) and initialize ptrl[r'] = start of N A (r') for r' G -R 

4 for cur = 1 to 7ii do 

5 if Apfcwr] = 1 then 

/* Type 1 work : Lines \§\ to 1131 - Computing MinColor[cur] = the 
minimum color given to a non-edge between x and N B (cur) n R 
*/ 

6 for each r' in R do 

7 while iV^ (r')[ptrl[r']] < cur and not list-end of N A (r') do 

8 Increment the pointer ptrl[r'] 

9 end 

10 if N A (r')[ptrl[r']] = cur then 

11 MinColor[cur] — min(MinColor[cur], Color(xr')) 

12 else if list-end of N A (r') then delete r' from R 

13 end 

/* Type 2 work : Lines 1141 to 1211 - Update Next of non-edges 

between cur and Q */ 

14 for each q in Q do 

15 while N A (q') [ptr 2 [q'\] < cur and not list-end of N A (q') do 

16 Increment the pointer ptr2[q'] 

17 end 

18 if N A (q')[ptr2[q']} = cur then 

19 Next(cur q') — min(Next(cur q'), MinColor[cur]) 

20 else if list-end of N A (q) then delete q' from Q 

21 end 

22 end 

23 end 
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happens n\ — deg A (q') times during the processing of each x where x € N A (g'). 
Hence the total cost for Type 2 work (over all invocations of Algorithm 0|) is 

(n 1 -deg A (b')).deg A (b') = 0(en). 

b'GB 

Thus the total time spent over all invocations of Algorithm [4] is 0(en + n 2 ) 
as claimed. □ 

Once Next(ab') for (vertex of H* corresponding to) each ab 1 £ E(H) is 
correctly computed by invoking Algorithm |4] for each x £ A, we compute 
MaxSi = {ab' e S l : Next(ab') > i} = {ab' e E{H) : Color{ab') < i and 
Next(ab') > i}, for 1 < i < k. This can be done in overall 0(k.\E(H)\)=0(k.n 2 ) 
time. For 1 < i < k, let Ei = {e G E{H): e corresponds to a vertex of H* in 
MaxSi}. As mentioned in the beginning of Section[B] {&, = Hi : Hi = (V,Ei), 
1 < i < k}, gives an optimal box representation of G. Since each d can be 
computed from Ei in 0(n 2 ), the overall running time of computing the box 
representation using EiS is 0(kn 2 ). 

Thus, the total time used for the algorithm for computing an optimal box 
representation of G is 0(en + kn 2 ) as claimed. 



